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Abstract 

Background: The dose-response relationship is a fundamental pharmacological parameter necessary to determine 
therapeutic thresholds. Epi-allelic hypomorphic analysis using RNA interference (RNAi) can similarly correlate target 
gene dosage with cellular phenotypes. This however requires a set of RNAi triggers empirically determined to 
attenuate target gene expression to different levels. 

Results: In order to improve our ability to incorporate epi-allelic analysis into target validation studies, we 
developed a novel flow cytometry-based functional screening approach (CellSelectRNAi) to achieve unbiased 
selection of shRNAs from high-coverage libraries that knockdown target gene expression to predetermined levels. 
Employing a Gaussian probability model we calculated that knockdown efficiency is inferred from shRNA sequence 
frequency profiles derived from sorted hypomorphic cell populations. We used this approach to generate a 
hypomorphic epi-allelic cell series of shRNAs to reveal a functional threshold for the tumor suppressor p53 in 
normal and transformed cells. 

Conclusion: The unbiased CellSelectRNAi flow cytometry-based functional screening approach readily provides an 
epi-allelic series of shRNAs for graded reduction of target gene expression and improved phenotypic validation. 



Background 

Delineating gene function relies on the genetic principle 
established by decades of genetic mapping that mutations 
establish causality in phenotypes. Tra^s-dominant genetic 
effectors such as small molecule inhibitors and small inter- 
fering RNA (siRNA) that alter the expression and function 
of specific genes and proteins in trans, have dramatically 
enhanced our ability to correlate the extent of cell behav- 
ioral changes with the perturbation of gene expression and 
function. Pharmacological dose response studies correlate 
the degree of compound-induced inhibition of protein 
function with cellular responses. Similarly, hypomorphic 
alleles, mimicked by RNAi, have served as valuable tools 
for deciphering the roles of various genes in cell function 
[1,2]. Such epi-allelic hypomorphic analysis using different 
RNAi triggering sequences empirically determined to 
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knockdown gene expression to distinct levels, provides a 
unique opportunity to correlate gene expression levels 
with phenotypic outcome [3]. Hence, epi-allelic analysis 
can provide a dose response profile for putative thera- 
peutic targets in lieu of small molecule inhibitors. 

In order to improve our ability to conduct epi-allelic 
analysis, we developed a robust flow cytometry-based func- 
tional screening approach to enable unbiased selection of 
shRNAs that generate graded knockdown of target gene 
expression to predetermined levels. This genetic dose-re- 
sponse can be used to validate candidate drug targets and 
predict in vivo responses to inhibitor compounds. 

Methods 

Cell culture 

All cells were cultured in a humidified atmosphere at 37°C 
with 5% C0 2 . Phoenix A retroviral packaging cells (Dr. 
Garry Nolan, Stanford University, USA) and MDA-MB-231 
cells (ATCC) were maintained in DMEM (Gibco) and 
F-12 K nutrient mixture (Kaighn's modification) respectively, 
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both supplemented with 10% fetal bovine serum (FBS) 
(EuroClone, Italy), 2 mM L-glutamine, 100 U/ml penicillin 
and 100 (ig/ml streptomycin (all from Sigma- Aldrich) [4]. 
Leukemic cell line HL60 (gift from Dr. Bjorn T. Gjertsen; 
University of Bergen, Norway) were cultured in Iscoves 
medium containing 10% FBS, antibiotics and glutamine 
( Sigma- Aldrich) [5]. Primary human umbilical vein en- 
dothelial cells (HUVEC) (C2517A, Lonza, Switzerland) 
were maintained in the supplier s recommended complete 
medium (EGM-2). 

Retroviral transductions 

Phoenix A cells were transfected using the calcium phos- 
phate method [4]. Approximately 30 h after transfection, 
the medium was changed to the appropriate growth 
medium for the cells to be infected. Infectious supernatant 
was harvested 48 h after transfection. Target cells were 
spin-infected in virus -containing supernatant supplemen- 
ted with 5 (ig/ml protamine sulfate for 90 min at 1200 x g, 
and then incubated overnight before being returned to 
regular growth medium. Infected cells were selected with 
1 (ig/ml puromycin for 48 h. 

Immunostaining and flow cytometry 

The MDA-MB-231 cells were trypsinized using standard 
procedures and washed with 0.2% BSA/PBS. Cells were 
stained with anti-Axl antibody (MAB154; R&D Systems, 
USA) at a final concentration of 2 (ig/ml in 0.2% BSA/ 
PBS for 40 min. at room temperature. The cells were 
then washed twice in 0.2% BSA/PBS and incubated with 
secondary antibody (goat anti-mouse allophyocyanin- 
crosslinked; Molecular Probes, USA) at a final concen- 
tration of 0.2 (ig/ml for 30 min. at room temperature in 
the dark. The cells were washed twice with 0.2% BSA/ 
PBS and resuspended in 300 \A of 0.2% BSA/PBS before 
analysis on a FACSAria cell sorter (BD Biosciences, USA). 
Data analyses were carried out with Flowjo software (Tree 
Star, USA). 

Constructs 

Functional shRNAs that were identified from the library 
screen, were recloned into a new vector. The shRNAs 
were expressed from a modified human U6 promoter 
in the LTR of the retroviral vectors RRI-Red/L087 
(GenBank: EU424173). For sequence information on 
shAxl2, shAxl278, shAxl279, shAxl280 and shLuc see [2]. 

The retroviral expression vector pCSI expresses a 
puromycin-N-acetyl-transferase-EGFP and firefly lucif- 
erase cassette (Puro2A- GFP2ALuc2), with each open- 
reading frame separated by a linker encoding the 2A 
region (XXSGLRSGQLLNFDLLKLAGDVES- NPGP) 
from foot-and-mouth disease virus for stoichiometric 
co-translational polyprotein cleavage [6]. 



Construction of the reporter gene vector 

Reporter constructs were made by cloning the target 
cDNA (Axl: 2.9 kb EcoRI/BamHI fragment from IMAGE 
cDNA clone 5205825; p53: 1.5 kb PshAl/NotI fragment 
from pSVp53 (gift from B.T. Gjertsen) into the 3 '-un- 
translated region of retroviral vector L101 (Complete 
sequence in Additional file 1: Figure SI). 

shRNA library construction 

The construction of the Axl library has been described 
previously [2]. The p53 library (p53 cDNA kindly provided 
by BT Gjertsen, University of Bergen) was produced by a 
new method (see Additional file 2: Figure S2), generating 
a larger number of longer hairpins. p53 cDNA was frag- 
mented with each of the following frequently-cutting 
enzymes: HpaII/Hin6I (mixed), Alul, Dpnl, BsuRI, Rsal, 
HpyCH4V and blunt-ended with Klenow. The fragments 
in each digestion mix were then capped at each end by 
ligation with excess Hairpin oligo DK540 (PhoCTGCTG 
GATCCAGAGACTCCAGAATTGTGAGCGCTCACAA 
TTCTGGAGATGAGCCTTCGGTCTCATCTCTGGAT 
CCAGCAG). PvuII and an appropriate frequent-cutter 
enzyme were added to each mixture to inhibit unwanted 
side reactions. After overnight ligation, reactions were 
pooled, phenol extracted, chloroform extracted and size- 
selectively precipitated with 15% PEG 8000/0.1 M MgCl 2 / 
20 \ig glycogen to get rid of excess DK540 [7,8] Hairpin 
structures comprising 27 nt of p53-derived sequence 
joined to hairpin oligo DK540 were released from each 
end of the capped fragments by digestion with EcoP15I, 
dephosphorylated with Antarctic Phosphatase (NEB) and 
cleaned up by phenol/chloroform extraction and ethanol 
precipitation. Following blunt-ending with Klenow the 
hairpin loops (apparent MW approx 80 bp) were run on a 
2% agarose/5 mM sodium borate gel [9] and purified by 
electroelution onto GF/C paper (Whatman). 

The hairpins were cloned and converted into hairpin 
expression cassettes in a single step: Dephosphorylated hair- 
pins were ligated to Nael/Narl-cut L267 pEntr-U6NaeIXcmI, 
a kanamycin resistant derivative of pDONR221 carrying a 
Pacl-U6 promoter-Termination signal-Hindlll cassette, 
and then cut with Xcml. The relatively small size of this 
vector compared to retroviral vector L087 simplifies the 
cloning and gives improved transformation efficiencies. 
The product of ligation contains a nick at the 5 ' end of 
the hairpin due to the lack of 5' phosphate. Extension 
from the free 3 'OH at this nick with a strand-displacing 
polymerase (Klenow) opens out the hairpin structure into 
dsDNA and at the same time removes the single nucleo- 
tide 3' Xcml overhang. The construct was then recir- 
cularised and electroporated into competent cells and 
plated. This primary library (many thousand colonies) was 
scraped up and plasmid DNA isolated. The size of the 
hairpin loop was reduced by digestion with Gsul, blunt- 
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ending with Klenow, gel-purification, recircularisation and 
re-cloning. Finally, the complete U6:hairpin:termination 
signal cassette was transferred into retroviral vector L297, 
a slight modification of vector L087. Restriction enzymes 
and polymerases were from New England Biolabs or 
Fermentas. 

Isolation of p53 targeting shRNAs 

To select shRNAs that target human p53 mRNA from 
the approximately 100 unique shRNA sequences, we 
prepared virus supernatants from the p53 shRNA library 
and control shLuc vectors as described above. Three 
independent populations of 2 x 10 5 HL60 RNAi p53 
reporter cells were infected with virus at a transduction 
efficiency below 10% to ensure favor single retroviral inte- 
grations per cell, with approximately 200 independent in- 
tegrations per shRNA contstruct. After 48 h of Puromycin 
selection and another 2 days of culture, mCherry positive 
(as a marker for library infected cells) cells displaying 
reduced GFP levels were sorted using a FACSAria cell 
sorter. One hundred cells per well were sorted directly 
into a 96-well PCR microplate (FS-C, Axygen Bioscienes) 
containing 10 \i\ lx Thermopol buffer/proteinase K 
(100 (ig/ml) (Sigma- Aldrich) per well. After sorting, 
the PCR plate was incubated for 30 min at 37°C, and then 
the enzyme was inactivated at 95°C for 10 min. Each well 
was added 10 \A 2x PCR mix (see below) to a final volume 
of 20 \il 

Amplification of shRNA pools from sorted cells 
PCR mixture 

The PCR mixture for amplifying vector genomic DNA 
in a total volume of 20 \A contained 1.3 U and 0.1 U of 
Taq/Pfu DNA Polymerase (Fermentas), lx Thermopol 
buffer (containing 20 mM of Tris-HCl (pH 8.8), 10 mM 
of KC1, 10 mM of (NH4) 2S04, 2 mM of MgS04, 0.1% 
of Triton X-100) (NEB), 0.25 \iM of each primer, 
0.375 mM dNTP mix (Fermentas). The reaction mixture 
for the second amplification round was the same as for 
the first one, except that 0.5 \iM of the "inner" primers 
with tags were used instead of the "outer" primers. In 
the second amplification 2 \A of the first round PCR 
product was used and the reaction was also supple- 
mented with 2.5 mM MgCl 2 . See Table 1 for primer 
information. 

Nested PCR 

PCR thermal cycles were carried out using a thermal 
cycler (PTC- 100, MJ Research) to amplify vector genomic 
DNA using the following sequences: preheat at 95°C for 
5 min; 35 cycles of 95°C for 30 s, 58°C for 20 s and 72°C 
for 40 s. After 35 cycles of first round PCR, the second 
round PCR thermal cycles were performed: preheat at 
95°C for 2 min; 20 cycles of 95°C for 20 s, 56°C for 



20 s and 72°C for 30 s; additional extension at 72°C 
for 5 min to end the amplification for both programs. 
DNA was analyzed by 2% agarose gel electrophoresis and 
stained by ethidium bromide. 9 \A of each nested PCR 
were pooled, run on agarose gel and the PCR band 
(ca. 280 nt) was purified with Illustria GFX PCR DNA 
and gel band purification kit (GE Healthcare) before 454 
sequencing (Roche) at The Norwegian High-Throughput 
Sequencing Centre (University of Oslo). 

454 sequence data analysis 

Raw FASTA format 454 sequence data were analysed 
using a series of simple Perl scripts (D. Micklem, unpub- 
lished). Initially, the sequences were scanned to deter- 
mine orientation (sequencing off A or B-primer) and to 
identify the multiplex tag associated with each primer. 
Primer and constant vector sequences were then deleted 
and empty/too short sequences excluded. Remaining se- 
quences were tested to determine whether they encoded 
a hairpin structure. Finally, the sequences were clustered 
into groups of near-identical sequences and the number 
of times each sequence was seen in each tag-group was 
tabulated. 

Immunoblotting 

Cells were lysed in RIPA buffer (25 mM Tris-HCl 
pH 7.6, 150 mM NaCl, 1% NP-40, 1% sodium deoxycho- 
late, 0.1% SDS) supplemented with protease inhibitor 
(13457200; Roche). After addition of lysis buffer, the ex- 
tract was incubated for 5 min on ice, and then passed 
through a 29 gauge syringe. The suspension was centri- 
fuged for 10 min at 10000 rpm and the supernatant pro- 
tein concentration was measured using a BCA protein 
assay kit (#23227, Pierce). SDS/PAGE (Novex NuPAGE 
Bis-Tris 4-12% gels and MES buffer) and protein transfer 
were carried out according to standard procedures given 
by the manufacturer (Life Technologies). 20 \ig protein 
was loaded on the gel. Western blots were developed 
with ECL substrate (#32106, Pierce) and the signal was 
detected and analyzed using the BioRad ChemiDoc 
XRS + system. p53 was detected using mouse monoclonal 
anti-human p53 antibody (abllOl, Abeam) and Glyceral- 
dehyde 3-phosphate dehydrogenase (GAPDH) loading 
control was detected with Millipore MAB379 antibody 
(clone 6C5). Secondary antibody Goat anti-mouse (GAM) 
IgG HRP (#G21040, Invitrogen). 

Intensity of individual bands was quantified using ImageJ 
densitometry software, andexpressed relative to GAPDH 
signal, as a measure of protein relative abundance in the 
different samples. 

Irradiation 

L108 Luc, shp53-3, shp53-10 and shp53-13 HUVEC 
cells were harvested by trypsination and replated in 
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Table 1 Primers used in the amplification of shRNA pools from sorted cell 



Primer sequence 



Id 



Info 



CCGAGGTGGGCAGTCAATCAATCT 
ACGCCA^GCAAGGCAT 
454-adaptor primers 

Lowercase letters provide a multiplexing code to allow separate PCR reactions 

GCCTCCCTCGCGCCATCAGagcmOTGGGTAGmGCAGm 

GCCTCCCTCGCGCCATCAGtgamOTGGGTAGmGCAGm 

GCCTCCCTCGCGCCATCAGcagmOTGGGTAGmGCAGm 

Type A primers are identical except for the 3 nt's in small letters 

GCOTGCCAGCCCGCTCAGtactgacTOTACAAGAAAGCTGGGTAAG 

GCOTGCCAGCCCGCTCAGtacgatcTOTACAAGAAAGCTGGGTAAG 

GCOTGCCAGCCCGCTCAGtacagtcTOTACAAGAAAGCTGGGTAAG 

GCOTGCCAGCCCGCTCAGtagactcTOTACAAGAAAGCTGGGTAAG 

GCOTGCCAGCCCGCTCAGtagcatcTOTACAAGAAAGCTGGGTAAG 

GCOTGCCAGCCCGCTCAGtacgtacTOTACAAGAAAGCTGGGTAAG 

GCOTGCCAGCCCGCTCAGtagctacTOTACAAGAAAGCTGGGTAAG 

GCOTGCCAGCCCGCTCAGtagtcacTOTACAAGAAAGCTGGGTAAG 

GCOTGCCAGCCCGCTCAGcactgagTOTACAAGAAAGCTGGGTAAG 

GCOTGCCAGCCCGCTCAGcacgatgTOTACAAGAAAGCTGGGTAAG 

GCOTGCCAGCCCGCTCAGcacagtgTOTACAAGAAAGCTGGGTAAG 

GCOTGCCAGCCCGCTCAGcagactgTOTACAAGAAAGCTGGGTAAG 

GCOTGCCAGCCCGCTCAGcagcatgTOTACAAGAAAGCTGGGTAAG 

GCOTGCCAGCCCGCTCAGcacgtagTOTACAAGAAAGCTGGGTAAG 

GCOTGCCAGCCCGCTCAGcagatcgTOTACAAGAAAGCTGGGTAAG 

GCOTGCCAGCCCGCTCAGcagtcagTOTACAAGAAAGCTGGGTAAG 

Type B primers are identical except the 7 nt's in small letters. 



DK633outerr 
DK637altouterf 

to be pooled and sequenced together 
DK621 ACodeAGC 
DK621 ACodeTGA 
DK621 ACodeCAG 

DK622BCodeTActgaC 
DK622BCodeTAcgatC 
DK622BCodeTAcagtC 
DK622BCodeTAgactC 
DK622BCodeTAgcatC 
DK622BCodeTAcgtaC 
DK622BCodeTAgctaC 
DK622BCodeTAgtcaC 
DK622BCodeCActgaG 
DK622BCodeCAcgatG 
DK622BCodeCAcagtG 
DK622BCodeCAgactG 
DK622BCodeCAgcatG 
DK622BCodeCAcgtaG 
DK622BCodeCAgatcG 
DK622BCodeCAgtcaG 



outer primer 
outer primer 



type, nested 
type, nested 
type, nested 

type, nested 
type, nested 
type, nested 
type, nested 
type, nested 
type, nested 
type, nested 
type, nested 
type, nested 
type, nested 
type, nested 
type, nested 
type, nested 
type, nested 
type, nested 
type, nested 



12- or 24- well plates 24 h before irradiation. The cells 
were irradiated on the plate with 2 or 8 Gy using a 
Philips RT 100 X-ray machine with an acceleration 
voltage of 100 kV. 

mRNA extraction 

Irradiated cells were lysed by addition of 360 \A Buffer 
MRL (Qiagen)/well directly to the 24-well plates. mRNA 
was extracted and purified with a MagAttract Direct 
mRNA M48 Kit with oligo(dT) covered magnetic beads 
on a GenoM-48 Robotic Workstation (Qiagen). Standard 
settings for mRNA extraction were used with 100 \i\ 
elution volumes. 

cDNA synthesis 

10 \A of mRNA was used as template for cDNA synthesis 
in a 20 \A reaction. The reaction buffer consisted of First 
Strand Buffer xl, 10 mM dithiothretiol (DTT) and 5 U/\A 
SuperScriptTM II Reverse Transcriptase (Invitrogen) and 1 
\J/\A Protector RNase Inhibitor, 20 pmole/|il Hexanucleotide 



Mix and 0.25 mM of each dNTP, Li-Salt (Roche Diagnos- 
tics). The RT reaction was performed on a PTC-200 Peltier 
Thermal Cycler (MJ Research) at 22°C for 10 min, 42°C for 
45 min and 99°C for 3 min. 

Quantitative PCR 

TaqMan® Gene Expression Assays for p53, p21 and 
POLR2a (Pol II) were purchased from Applied Biosys- 
tems (Table 2). All primer products span exon-exon 



Table 2 Gene expression assays used for quantitative PCR 

Gene Exon-exon Cat. number 

boundary 

Cyclin-dependent kinase inhibitor 2-3 Hs00355782_m1 
1 A CDKN1 A/P21 

Tumor protein p53 TP53/p53 9-10 Hs00153349_m1 

Polymerase (RNA) II (DNA directed) 1 -2 Hs001 721 87_m1 

polypeptide A POLR2A 

All primers/probes were purchased from Applied Biosystems. 
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junctions to avoid amplification of genomic DNA. QPCR 
reactions were performed using 5 \A of cDNA in Micro- 
Amp™ 96-well optical microtiter plates on a 7900HT 
Fast QPCR System in TaqMan® Fast Universal PCR 
Master Mix (Applied Biosystems) according to protocol 
(total volume 25 \A). cDNA was diluted 10 times prior 
to qPCR and all samples were run in duplicate. PCR 
results were analyzed with SDS 2.3 software (Applied 
Biosystems) and relative quantity was determined using 
the AACT Method [10], with DMSO-treated cells as 
calibrator and POLR2A as endogenous reference. 



Gamma-H2AX staining 

Gamma-H2AX detection was performed essentially as 
previously described [11]. H2AX is a form of Histone 2 A 
that is phosphorylated upon DNA double strand break 
induction (gamma-H2AX) and can thus be used as a 
measure of DNA double strand breaks after irradiation. 
HUVEC cells were seeded in 12-well plates 24 h pre- 
irradiation. In brief, after irradiation the cells were 
incubated at 37°C for 30 min to allow complete H2AX 
phosphorylation and then kept on ice during the whole 
procedure to avoid phosphatase activity. After trypsination 
and harvest, the cells were incubated with Block-9 staining 
buffer containing anti-phospho-H2AX FITC-conjugated 
antibody (Millipore) and Vybrant Dye Cycle Violet stain 
(Invitrogen, Molecular Probes) for 3 h on ice. The in- 
crease in phosphorylated H2AX was then detected on a 
FACSAria flow cytometer (BD Biosciences). Excitation 
wavelengths were 488 nm/405 nm respectively and emis- 
sion was detected with filter/bandpass 450/40 for FITC 
and 530/30 for Vybrant Dye Cycle Violet Stain. 

EdU staining 

HUVEC cells were seeded out in 24-well plates 24 h 
pre-radiation. After irradiation, the cells were harves- 
ted by trypsination, diluted 1:8 and reseeded into 6-well 
plates. After growing for 48 h, EdU was added to the 
cells at a concentration of 10 |iM. EdU assay, as a 
measure of cellular proliferation, was performed es- 
sentially as earlier described (Ref: Flow cytometry methods 
for DNA-damage induced clonogenic cell death calibrated 
to the colony forming assay, Yue Gao, Pegah Johansson, 
Aida Muslimovic and Ola Hammarsten, submitted manu- 
script), using a Click-iT® EdU Alexa Fluor® 488 Imaging 
Kit (Invitrogen). In brief, after 16 h incubation with 
EdU, the cells were washed with PBS and harvested 
by trypsination. After a 0.1% BSA/PBS wash, the cells 
were fixed for 15 min. The cells were washed again 
in BSA/PBS and then permeabilized for 30 min. After 
another wash with BSA/PBS, the cells were incubated 
in staining buffer according to protocol for 30 min, 



kept from light. After a final wash in lx component 
E, the cells were incubated with Ribonuclease A for 
30 min. Before Analysis on a FACSAria flow cytometer 
(BD Biosciences), the cells were diluted in 300 \i\ 
BSA/PBS. 



Data analysis 

Statistical analysis was carried out using ANOVA 
followed by Tukeys post hoc tests (GraphPad 4.0 
Software, San Diego, CA). Significance was assumed 
for p value less than 0.05. Sigmoidal best fit curve 
generated with sigmoidal fit equation at http://zunzun.com, 
using the Hill with offset equation: ax A b/(c A b + x A b) + 
Offset. 

Results and discussion 

In an effort to derive RNAi-competent sequences from a 
target gene we prepared an shRNA-expressing retroviral 
library based on random 21-26 nt sequences (-250 se- 
quences) derived from the human Axl gene sequence, a 
receptor tyrosine kinase frequently expressed in human 
cancer and correlated with poor overall survival [12]. 
The shRNA library was transduced into MDA-MB-231 
breast carcinoma cells; individual cells with reduced Axl 
cell surface expression were isolated by FACS and the 
cognate shRNA sequences identified by PCR and deep 
sequencing. We noted that each sequence displayed a 
unique, reproducible gene knockdown level that was 
not predictable by analysis in available RNAi sequence 
algorithms (Additional file 3: Figure S3). Collectively, 
these Axl-targeting shRNAs comprised an epi-allelic 
hypomorphic series in MDA-MB-231 cells that vali- 
dated the role of Axl in metastasis in vitro and 
in vivo [2]. 

Based on these results we developed an approach to 
rapidly identify functional shRNA sequences from high- 
coverage gene specific shRNA libraries to enable epi-allelic 
hypomorphic analysis of putative therapeutic targets [1]. 
The natural fluctuation in protein levels routinely dis- 
played by flow cytometric analysis can be modeled by nor- 
mal probability distributions [13]. Hence, a cell population 
expressing an shRNA library against a target gene can be 
represented as a continuous distribution comprising mul- 
tiple samples. The probability that a cell will be found 
below a given threshold is proportional to the difference 
in target gene expression means of the hypomorphic and 
wild type cell populations (Additional file 4: Figure S4). 
Importantly, this leads to the prediction that the number 
of cells expressing a specific shRNA found below this 
threshold will be related to the potency of the shRNA. 
This implies that the frequency of different shRNA 
sequences recovered from a sorted cell population should 
be proportional to their knockdown levels. 
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Figure 1 The CellSelectRNAi approach, (a) CellSelectRNAi principle. HL60 reporter cells are transduced with a library of candidate shRNAs 

targeting gene of interest. Each cell is interrogated by flow cytometry; cells with reduced reporter levels are sorted for subsequent PCR and 

shRNA sequence identification by 454 deep sequencing. Sequences are plotted by frequency representation in the sorted population, (b) 

Schematic of the CellSelectRNAi system. The RNAi reporter comprises a destabilized GFP (dsGFP), with the untranslated target gene cloned 

downstream of a translational stop codon (*). In the scenario on the left, a RISC complex is loaded with an effective shRNA that will bind to the 

target mRNA sequence in the reporter mRNA and initiate mRNA cleavage and prevention of dsGFP reporter translation. In the scenario on the 

right, the RISC complex is loaded with an ineffective shRNA that is unable to bind the target mRNA. The efficiency of each shRNA is proportional 

to the amount of dsGFP in the cell, and can be measured by flow cytometry, (c) RNAi reporter correlates with endogeneous gene expression. 

Flow cytometric analysis showing the level of RNAi reporter and endogeneous Axl cell surface protein in each epi-allelic MDA-MB-231 cell line 

demonstrates a linear relationship between RNAi reporter intensity and Axl protein stained with Axl antibody. Each point on the graph represents 

the geometric mean and the graph is representative of four individual experiments. The error bars represent 95% confidence-limit on the mean, 
k ' J 
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Figure 2 (See legend on next page.) 
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(See figure on previous page.) 

Figure 2 Generation of an shp53 epi-allelic series, (a) HL60 p53-RNAi reporter cell line. A HL60 cell line was transduced with p53-RNAi 
reporter construct and a single cell-derived clone was isolated by FACS, expanded and reanalysed for optimal (GFP) reporter expression by flow 
cytometry, (b) HL60 p53-RNAi reporter cells transduced with shp53 library show a population with reduced RNAi reporter expression. HL60 
p53-RNAi reporter cells were transduced with a library of candidate p53 shRNA or control shRNA (shLuc). All cells were monitored by FACS and 
cells with reporter levels within the GFP| OW /RFP + gate {box) were sorted, (c) Functionally selected shRNA sequence frequency profile. Sequences 
retrieved from the 454 deep sequencing were tested to determine whether they encoded a hairpin structure and clustered into groups of 
near-identical sequences, and plotted by frequency, (d) Generalized linear model (GLM) fit of shRNA sequence frequency versus RNAi reporter 
level reveals a statistical relationship. Representative shp53-sequences were recloned, transduced into HL60 p53-RNAi reporter cells and analyzed 
by flow cytometry. A generalized linear model (GLM) with a log link function ('log(frequency) ~ 1 + knockdown') and Poisson distribution was 
applied to model the shRNA frequency versus RNAi reporter fluorescence intensity, generating estimated coefficients listed in Table 2, Additional 
file 4: Figure S4. The GLM model shows an exponential relationship between the observed shRNA frequency and measured RNAi reporter at a 
significance level of a = 0.05 (p~0). The reporter intensity data represents the mean of three independent experiments, (e) HUVE cells transduced 
with different shp53 clones generates an epi-allelic series. Transduced HUVE cells expressing different shp53 shRNA sequences show graded p53 
expression levels as measured by relative quantification of Western blot versus mRNA levels. 



To test this prediction, we devised a FACS -based 
screening approach to identify functional RNAi-triggers 
(CellSelectRNAi; Figure la). Libraries of candidate retrovi- 
rally expressed shRNAs are transduced into a population 
of host cells stably expressing a dedicated RNAi-GFP 
reporter and sorted for target gene knockdown by FACS. 
shRNA sequences are identified from the sorted cells 
by PCR and deep sequencing, and assembled into a fre- 
quency profile. The frequency of a given shRNA should 
correlate with knockdown efficiency. The RNAi- GFP 
reporter, comprising an open reading frame encoding a 
destabilized GFP [14], fused to hygromycin phosphotrans- 
ferase (hpt) together with the untranslated target cDNA 
sequence serving as a 3'-UTR may be used to identify 
functional shRNAs for any gene (Figure lb). To validate 
this RNAi-reporter system we prepared an Axl RNAi- 
reporter MDA-MB-231 cell line that was transduced sep- 
arately with four different Axl shRNAs (shAxl2, shAxl280, 
shAxl279, shAxl278) in addition to a control shRNA 
targeting lucif erase (shLuc). These five cell lines were 
simultaneously analyzed for reporter and endogenous Axl 
surface receptor protein levels by multicolor flow cyto- 
metry. Axl RNAi-reporter GFP fluorescence correlated 
strongly (R 2 = 0.9931) with endogeneous surface Axl re- 
ceptor protein (Figure lc) in the epi-allelic MDA-MB-231 
cell series. 

To evaluate the CellSelectRNAi approach, we generated 
p53 RNAi-reporter expressing HL60 cells. Flow cyto- 
metric analysis of the HL60/p53-RNAi reporter cell line 
displayed dsGFP fluorescence with a mean fluorescence 
intensity of 1721 and coefficient of variation (CV) of 36.8 
(Figure 2a). A high coverage p53 shRNA library was 
prepared using a novel cloning approach where random 
sequences derived from the p53 mRNA sequence are 
converted into a form that encodes a hairpin structure 
and cloned into a U6 promoter cassette within a self- 
inactivating retroviral vector (Additional file 5: Figure S5). 



The p53 shRNA library (approximately 100 unique sequences) 
was transduced into the HL60/p53-RNAi reporter cell line 
and analyzed by FACS (Figure 2b). A low transduction 
rate (-10%) was used to favor single retroviral integrations 
per cell. A sorting gate was set at 2 standard deviations 
from the mean of the wild type population and HL60/ 
p53-RNAi reporter cells carrying p53-targeting shRNAs 
(RFP + ) within the sorting gate were isolated (Figure 2b). 
1800 sorting events were collected and proviral PCR amp- 
lification was conducted to generate a -280 bp product 
that was analyzed by massively parallel pyrosequencing. 
The resulting 14790 shRNA sequences, of which 32 were 
p53 shRNA (Additional file 6: Figure S6) were rank 
ordered by frequency to generate a distribution plot 
(Figure 2c). Eight p53 shRNA sequences representing 
high, intermediate and low frequency levels were recloned 
and reintroduced into the HL60/p53-RNAi reporter cell 
line and monitored for reporter knockdown by flow 
cytometry. As predicted by the continuous distribution 
model, the shRNA sequence frequency derived from the 
sorted population showed a clear statistical relationship 
with the reporter intensity (Figure 2d). The most prevalent 
shRNA sequences identified by the CellSelectRNAi ap- 
proach were the most potent also when measured at 
endogenous protein level and mRNA level (Figure 2e, 
Additional file 7: Figure S7). These results demonstrate 
accurate prediction of effective shRNA sequences by the 
CellSelectRNAi approach. Importantly, the most potent 
p53 shRNAs identified in the library screen are not 
predicted by available RNAi sequence algorithms [15] 
(Additional file 3: Figure S3). 

We conducted epi-allelic hypomorphic analysis using 
selected p53-targeting shRNAs to determine the thresh- 
old of p53-dependent responses to ionizing radiation in 
primary human endothelial cells. p53 is a transcription 
factor involved in the DNA damage response. p53 acti- 
vation by DNA damage, such as upon ionizing radiation, 
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(See figure on previous page.) 

Figure 3 Epi-allelic hypomorphic analysis to determine the threshold of p53-dependent responses to ionizing radiation, (a) Non-linear 
correlation between p53 and p21 mRNA levels in p53 epi-allelic primary human endothelial cells. Graded p53 knockdown in human umbilical 
cord endothelial cells (HUVEC) with shp53-3, shp53-10 shp53-13 or negative control (shLuc) {left panel) corresponded to a non-linear reduction in 
p21 gene expression {right panel). Data represent the mean ± SD of three (p53) and five (p21) measurements representative for two independent 
experiments relative to control cells (set as 100%). Statistical analysis was carried out using ANOVA followed by Tukey's post hoc tests, *P < 0,05. 
(b) Non-linear correlation between p53 and p21 mRNA levels in p53 epi-allelic primary human endothelial cells after irradiation. Irradiation of 
p53-epi-allelic HUVEC with 2 or 8 Gy did not affect p53 mRNA levels but increased p21 mRNA expression in a non-linear, p53-dependent manner. 
The data represents the mean of duplicate samples ± range, (c) Ionizing radiation induced H2AX phosphorylation in p53 epi-allelic primary 
human endothelial cells. H2AX phosphorylation, a measure of DNA double strand break induction, in HUVEC, correlates with greater p53 
knockdown. The data represents the mean of duplicate samples ± range, (d) Cell proliferation in p53 epi-allelic primary human endothelial cells 
with shRNA against p53 after irradiation. The number of EdU positive p53-epi-allelic HUVEC following ionizing radiation, as a measure of cell 
division, increases with p53 knockdown efficiency. The data represents the mean of duplicate samples ± range, (e) Dose response relationship 
between p53-levels and radiation-induced p21 activation by epi-allelic analysis. Ionizing radiation-induced p21 activation displays a sigmoidal 
dose-dependency on p53 gene expression. 

V ) 



results in increased expression of several inducible re- 
sponse genes. p21 is an example of a p53-regulated gene 
which is induced following ionizing radiation and was 
used here to assess the effects of shRNA-mediated 
knockdown on p53 activity [16]. As shown in Figure 3a-b, 
graded p53 knockdown in primary human endothelial 
cells resulted in a non-linear attenuation of p21 -activation 
following ionizing radiation. Radiation-induced double- 
strand break repair (histone H2AX phosphorylation) and 
DNA synthesis were similarly diminished in endothelial 
cells with less than 50% of wild type p53 expression levels 
(Figure 3c-d). Intriguingly, the observed dose-dependency 
of ionizing radiation-induced p21 -activation on p53 gene 
expression levels fits a sigmoidal dose response model 
with an apparent IC50 around 50% p53-expression, 
the expected gene dosage of a p53 +/ " heterozygote 
(Figure 3e). 

Conclusion 

These results demonstrate that flow cytometric selection 
of RNAi knockdown cells combined with deep sequence 
analysis can be used to identify multiple shRNA se- 
quences that reduce target gene expression to predeter- 
mined levels. Perhaps surprisingly, the most potent 
functionally-selected sequences do not follow widely 
used RNAi scoring algorithms and are therefore not 
represented in current RNAi collections. The ability to 
readily generate several shRNAs per gene carries signifi- 
cant advantages. In addition to obviating the influence of 
"off-target" effects [17,18], a set of shRNAs with differing 
silencing potentials provides an epi-allelic series of hypo- 
morphs, a genetic dose-response, that can uniquely 
relate gene expression to functional response thresh- 
olds and improve target-phenotype correlations [1,2]. 
Epi-allelic analysis highlights the "robustness" that 
buffers perturbations on biological systems and delin- 
eates drug target inhibition profiles for therapeutic 
translation [19]. 



Additional files 



Additional file 1: Figure SI. Retroviral vector L101 complete sequence. 

Additional file 2: Figure S2. (A) L267 cloning vector with details of 
polylinker. Hairpins are expressed under the control of a modified U6 
promoter, and cloned between the Nael at the transcription start and the 
Xcml at the transcription terminator. Pacl/Hindlll restriction enzyme sites 
to facilitate cloning of the completed cassette into the final vector. (B) 
Structure of Hairpin Oligo DK540, including EcoP15l site (blue), Gsul sites 
(red) and a half-Pvull site (green). The loop sequence present in the 
completed hairpin cassette is in yellow. (C) Hairpin Oligo DK540 ligated 
to each end of a short fragment of p53. Digestion with EcoP13l releases 
two hairpin-tagged sequences which are then blunt-ended with Klenow 
fragment and dephosphorylated. (D) Nael/Xcml-digested vector is ligated 
to haripin-tagged sequences. Because the haripin tag lacks a 5' phosphate, 
the ligation product will have a 'nick' in the DNA phosphate backbone as 
indicated. DNA polymerisation from this nick using a strand-displacing 
polymerase opens out the hairpin into dsDNA. (E) Subsequent digestion 
with Gsul and recircularisation shortens the hairpin loop. 

Additional file 3: Figure S3. Functionally selected shRNA sequences 
fail in RNAi scoring algorithms. Hairpin sequences from this study that 
gave good knockdown efficiency (actual efficiency) were scored with 
eight different RNAi scoring algorithms using the i-Score server 
(http://www.med.nagoya-u.ac.jp/neurogenetics/i_Score/i_score.html). Bold 
italic, score within top 100 for gene. Git: guide strand-loop-template strand 
orientation. Tig: template strand-loop-guide strand orientation. 

Additional file 4: Figure S4. Predictions of knock-out effect. 

Additional file 5: Figure S5. Schematic drawing of the hairpin 
expression cassette. shRNAs are produced from the U6/tethracyclin- 
inducible (TetO) promoter. The U6 promoter can transcribe short RNAs 
by RNA Polymerase III. This expression cassette also contains genes 
coding for mCherry and Puromycine separated by a linker encoding the 2A 
self-cleaving sequence under the control of the EF1a promoter. 
Abbreviations: pA, polyadenylation signal; uro, puromycin; Y, packaging 
signal; T, terminator. 

Additional file 6: Figure S6. Sequences that have been 

scored as hairpins in screen. Lowercase letters signify hairpin loop. 

Additional file 7: Figure S7. Epiallelic HUVE cells comprising 
p53-targeting shRNA's show graded p53 expression levels by 
immunoblotting. 



Abbreviations 

dsGFP: Destabilized GFP; FACS: Fluorescence activated cell sorter; GFP: Green 
fluorescent protein; HUVEC: Human umbilical vein endothelial cell; LTR: Long 
terminal repeat; nt: Nucleotides; RFP: Red fluorescent protein; RNAi: RNA 
interference; shRNA: Short hairpin RNA. 



Micklem et al. BMC Biotechnology 2014, 14:57 
http://www.biomedcentral.eom/1 472-6750/1 4/57 



Page 1 1 of 1 1 



Competing interests 

M.B. and D.R.M. are current employees of BerGenBio AS. J.B.L. is a founder, 
equity stockholder and consultant for BerGenBio AS. Aspects of the 
methodology presented are part of a patent (U. S. Patent No.: 8,735,064). 

Authors' contributions 

DRM, MB, OH and JBL designed the study; DRM and JBL developed the 
shRNA methodology; MB, PB, CT, TH and CG performed experiments; EH, PB 
and MB performed the statistical analysis; JBL, DRM and MB wrote the 
manuscript. All authors have read and approved the final manuscript. 

Acknowledgements 

This work was supported by a grant from the Norwegian Research Council 
to J.B.L, a University of Bergen predoctoral fellowship to M.B. and 
postdoctoral fellowship to DRM. The funders had no role in study design, 
data collection and analysis, decision to publish, or preparation of the 
manuscript. 

Author details 

department of Biomedicine, University of Bergen, N-5009 Bergen, Norway. 
2 BerGenBio A/S, N-5009, Bergen, Norway, department of Clinical Chemistry 
and Transfusion Medicine, Sahlgrenska University Hospital, S-413 45 
Goteborg, Sweden. 

Received: 9 December 2013 Accepted: 18 June 2014 
Published: 21 June 2014 



14. Li X, Zhao X, Fang Y, Jiang X, Duong T, Fan C, Huang CC, Kain SR: 
Generation of destabilized green fluorescent protein as a transcription 
reporter. J Biol Chem 1998, 273:34970-34975. 

15. Ichihara M, Murakumo Y, Masuda A, Matsuura T, Asai N, Jijiwa M, Ishida M, 
Shinmi J, Yatsuya H, Qiao S, Takahashi M, Ohno K: Thermodynamic 
instability of siRNA duplex is a prerequisite for dependable prediction of 
siRNA activities. Nucleic Acids Res 2007, 35:e123. 

16. Prives C, Hall PA: The p53 pathway. J Pathol 1999, 187:1 12-126. 

1 7. Nature Editorial: Whither RNAi? Nat Cell Biol 2003, 5:489-490. 

18. Cullen BR: Enhancing and confirming the specificity of RNAi experiments. 
NatMeth 2006, 3:677-681. 

19. Friedman A, Perrimon N: Genetic screening for signal transduction in the 
era of network biology. Cell 2007, 128:225-231. 



doi:1 0.1 1 86/1 472-6750-1 4-57 

Cite this article as: Micklem et al.: Flow cytometry-based functional 
selection of RNA interference triggers for efficient epi-allelic analysis of 
therapeutic targets. BMC Biotechnology 2014 14:57. 



References 



10. 



12. 



13. 



Hemann MT, Fridman JS, Zilfou JT, Hernando E, Paddison PJ, Cordon-Cardo 
C, Hannon GJ, Lowe SW: An epi-allelic series of p53 hypomorphs created 
by stable RNAi produces distinct tumor phenotypes in vivo. Nat Genet 
2003, 33:396-400. 

Gjerdrum C, Tiron C, Hoiby T, Stefansson I, Haugen H, Sandal T, Collett K, Li 
S, McCormack E, Gjertsen B, Micklem D, Akslen L, Glackin C, Lorens J: Axl is 
an essential epithelial-to-mesenchymal transition-induced regulator of 
breast cancer metastasis and patient survival. Proc Natl Acad Sci 2010, 
107:1124-1129. 

Wille L, Kemp ML, Sandy P, Lewis CL, Lauffenburger DA: Epi-allelic Erkl and 
Erk2 knockdown series for quantitative analysis of T cell Erk regulation 
and IL-2 production. Mol Immunol 2007, 44:3085-3091. 
Swift SE, Lorens JB, Achacouso P, Nolan GP: Rapid Production of 
Retroviruses for Efficient Gene Delivery to Mammalin Cells using 293T 
Cell-Based Systems. In Current Protocols in Immunology. Edited by Coligan 
JE, Kruisbeek AM, Margulies DH, Shevach EM, Strober W. New York: Wiley; 
1999:1-17. 

Gallagher R, Collins S, Trujillo J, McCredie K, Ahearn M, Tsai S, Metzgar R, 
Aulakh G, Ting R, Ruscetti F, Gallo R: Characterization of the continuous, 
differentiating myeloid cell line (HL-60) from a patient with acute 
promyelocytic leukemia. Blood 1979, 54:713-733. 
Lorens JB, Pearsall DM, Swift SE, Peelle B, Armstrong R, Demo SD, Ferrick 
DA, Hitoshi Y, Payan DG, Anderson D: Stable, stoichiometric delivery of 
diverse protein functions. J Biochem Biophys Methods 2004, 58:101-1 10. 
Paithankar KR, Prasad KSN: Precipitation of DNA by polyethylene glycol 
and ethanol. Nucleic Acids Res 1991, 19:1346. 
Schmitz A, Riesner D: Purification of nucleic acids by selective 
precipitation with polyethylene glycol 6000. Anal Biochem 2006, 
354:311-313. 

Brody J, Kern S: Sodium boric acid: a Tris-free, cooler conductive medium 
for DNA electrophoresis. Biotechniques 2004, 36:214-216. 
Livak KJ, Schmittgen TD: Analysis of relative gene expression data using 
real-time quantitative PCR and the 2-[Delta][Delta]CT method. 

Methods 2001, 25:402-408. 

Muslimovic A, Ismail IH, Gao Y, Hammarsten O: An optimized method for 
measurement of gamma-H2AX in blood mononuclear and cultured cells. 

Nat Protocols 2008, 3:1187-1193. 

Linger RMA, Keating AK, Earp HS, Graham DK: Taking aim at Mer and Axl 
receptor tyrosine kinases as novel therapeutic targets in solid tumors. 

Expert Opin Ther Targets 2010, 14:1073-1090. 
Hoffmann M, Chang HH, Huang S, Ingber DE, Loeffler M, Galle J: 
Noise-driven stem cell and progenitor population dynamics. PLoS ONE 
2008, 3:e2922. 



Submit your next manuscript to BioMed Central 
and take full advantage of: 

• Convenient online submission 

• Thorough peer review 

• No space constraints or color figure charges 

• Immediate publication on acceptance 

• Inclusion in PubMed, CAS, Scopus and Google Scholar 

• Research which is freely available for redistribution 



Submit your manuscript at 
www.biomedcentral.com/submit 



(3 BioMed Central 



v. 



